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ARCHITECTURE 

CROSS REFERENCE TO RELATED APPLICATION 
This application is related to Application 
No, (L13, 12-0172/00-629) filed on even date herewith for 
"Optimization of Comparator Architecture" by Serge j B. 
Gashkov^ Alexander E, Andreev and Aiguo Lu and assigned 
to the same assignee as the present invention, which 
application is incorporated herein by reference. 

FIELD OF THE INVENTION 
This invention relates to digital adders, and 
particularly to adders for use in application specific 
integrated circuits (ASICs) . 

BACKGROUND OF THE INVENTION 
Adders are widely used in ASICs and represent 
one of the most common and basic circuit functions of 
general purpose digital computers and processors. 
Adder based circuits include adders, subtracters, 
adder- subtracters, incrementors, decrement or s , 

incrementor-decrementors, and absolute value 
calculators, to name a few. Nearly every datapath 
module of nearly every digital IC includes adder 
based circuits. Thus, adders are crucial to the 
operation of computers, processors and controllers. 

Each element on a chip, be it an element of an 
adder or some other device, is derived from a library 
of cells, the library being technology dependent 
based on the processing technology used to fabricate 
the IC chip. Each cell requires some space (area) on 
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the chip, and the cells forming the element require 
some depth to the chip. Consequently, the elements 
formed by the cells require some minimum amount of 
volume on the chip. 
5 Most adders are implemented to perform a Boolean 

function such that S_n = A_n+B_n, where Sjn is the 
digital output and A_n and B_n are the digital 
inputs. Most adders are composed of AND, OR and 
Exclusive-NOR (XNOR) elements. However, these 

10 elements often require considerable space and depth, 
and signal propagation through the element may cause 
timing delays. 

As integrated circuit processing continues to 
advance, the need increases for smaller, faster 

15 adders. The present invention is directed to this 
need. 

The aforementioned application of Gashkov et al. 
describes a comparator architecture for ICs based on 
a Fibonacci series. The resulting circuit is smaller 
20 and has less depth, and hence less delay, than a 
corresponding comparator of traditional design. The 
present invention extends that concept to adders for 
ICS. 

SUMMARY OF THE INVENTION 
25 In accordance with the present invention, an 

adder is implemented as three modules, an input 
module, a carry module, and an output module. The 
input and/or output modules are optimized based on a 
global analysis of the Boolean representations of 



functions to optimize circuit area and depth and 
reduce delay. The carry module is based on a 
Fibonacci series. 

In one embodiment, an adder based circuit is 
embodied in an integrated circuit. The adder based 
circuit comprises an input module that receives 
inputs A[i] and B[i] to generate U[i] = A[i]&B[i] and 
either V[i] = A[i]vB[i] or V[i] = A[i]eB[i]. A carry 
module is responsive to U[i] and V[i] to generate 
carry functions. An output module is responsive to 
the U[i], V[i] and carry functions to provide an 

output function S = J^TS[i] = J^^'A\i] + £2'5[z] . The 

/=0 1=0 t-0 

carry module has a minimal depth defined by a 
recursive expansion of at least one carry function 
associated with the carry module based at least in 
part on a variable k, where k = F_l and n-k = F__{1-1} 
and where 1 satisfies F_l < n = F_{1+1}, {F_l} is a 
Fibonacci series and n is the number of bits of at 
least one of U[i] and V[i] . 

In various embodiments, the adder based circuit 
functions as a subtractor, adder-subtractor , 
increment or , decrementor, increment-deer ementor and 
absolute value calculator, depending on elements 
applied to the inputs and outputs. 

In some embodiments, an adder is designed for an 
integrated circuit. At least one output function of 
a carry module of the adder is defined in terms of a 
Fibonacci series. The output functions are 



recursively expanded to find a minimum parameter of 
the Fibonacci series. 

In one embodiment, recursive functions are 
defined as h'_l = h_l (U [k+1] ,U [k+2] , V[k:+2] , . . . , 
U[k4-1] ,V[k+l] ) and v'_l = V[k+1] &V[k+l] , based 
on the carry functions, where k=F_l and n-k=F_{l~l} , 
1 satisfies F_l < n = F_{1+1}, {F_l} is the Fibonacci 
series defined recursively from the equality 
F_{1+1}=F{1}+F_{1+1} and n is the number of bits of 
an input to the carry module. The recursive 
functions are recursively expanded to minimize 1- 

In some embodiments, negations are optimally 
distributed in the carry module by identifying a set 
of delay vectors, recursively comparing the delay 
vectors to derive a set of minimum vectors, and 
selecting a vector with a minimum norm from the set 
of minimum vectors. 

In other embodiments the depth of the adder is 
minimized by recursively expanding expressions 

DH_n = min{max(DH_{n-k}+l,Dh_k+2,DH_k} } and 
Dh_n = min{max(Dh_{n-k}+l,Dh_k+2} }, 
where DH_k and Dh_k are based on vectors of depths 
U[i] for 0 < i < k-1 and DH_{n-k} and Dh_{n~k} are 
based on vectors for k < i < n-1. A value of k is 
selected based on a minimum of at least one of DH_n 
and Dh_n. 

In other embodiments, the fanout depth of the 
adder based circuit is minimized by defining 
recursive functions 



H^i_k = h^{i+l}_{k-l}vv^{i+l}_{k-l}6ch^i_l 
and 

v^i_l = v^{i+l}_{k^l}&v^i_l 

based on the output function, 
where k=F_l and n-k=F_{l-l}, 1 satisfies 
F l<n=F_{l-M}, {F_l} is the Fibonacci series defined 
from the equality F_{ 1+1 }=F{ 1 }+F_{ 14-1 } . The 
recursive functions are recursively expanded to 
minimize 1. 

In other embodiments, the invention is manifest 
in a computer readable program containing code that, 
when executed by a computer, causes the computer to 
perform the process steps to design an adder based 
circuit for an integrated circuit based on a Fibonacci 
series . 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1 is a functional diagram of a basic adder 
circuit according to one embodiment of the present 
invention. 

FIGS. 2A and 2B are functional diagrams of the 
input and output modules of the circuit shown in FIG. 
1. 

FIG- 3 is a functional diagram of a carry module 
according to one embodiment of the present invention 
for use in the circuit shown in FIG. 1. 

FIG. 4 is a functional diagram of a subcircuit 
used in the carry module shown in FIG. 3. 



FIG. 5 is a functional diagram of a carry module 
according to another embodiment of the present 
invention for use in the circuit shown in FIG. 1. 

FIG. 6 is a functional diagram of one embodiment 
of a subcircuit used in the carry module shown in 
FIG. 5. 

FIGS. 7A and 7B are functional diagrams of 
circuits used in the subcircuit of FIG. 8, 

FIG. 8 is a functional diagram of another 
embodiment of a subcircuit used in the carry module 
shown in FIG. 5. 

FIGS. 9 and 10 are functional diagrams of other 
embodiments of carry modules used in the adder based 
circuit illustrated in FIG. 1 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

In the following, Section 1 describes Boolean 
representations of seven different adder-based 
circuits are described to demonstrate how they may be 
implemented from a common adder circuit by modifying 
the input and/or output modules of an adder according 
to the present invention. In Section 2, 

implementation of an adder according to the present 
invention is described, and Section 3 describes 
optimization techniques for the adder. 
1. BOOLEAN REPRESENTATIONS OF ARITHMETIC CIRCUITS 
1.1. Adder 

An adder is a circuit with inputs 
A[0] ,B[0] , . . . ,A[n-l] ,B[n-l] and outputs 



S [0] , . . . , S [n] , such that the sum of numbers A = ^2'A[i] 

1=0 

and B = ^2'B[i] is equal to S = J]2'5[/]. In accordance 

1=0 (=0 

with the present invention, an adder is implemented 
as three modules: an input module, a carry module, 
and an output module. The input and/or output 
modules are optimized based on a global analysis of 
the Boolean representations of functions to optimize 
circuit area and depth and reduce delay. The carry 
module is based on a Fibonacci series. 

The input module receives inputs A[i] and B[i] 
to generate U[i] = A[i]&B[i] and either V[i] = 
A[i]vB[i] or V[i] = A[i]©B[i]. The carry module is 
responsive to U[i] and V[i] to generate a carry 
functions W[i] . The output module is responsive to 
the U[i], V[i] and W[i] to provide an output function 
S[i] . 

Considering first a special class of subtracter 
circuit that subtracts an n-bit number B having bits 
(B [n-l] , . . . , B [0] ) from an (n+l)-bit number A having 
bit A[n]=l and bits (A[n-1],..., A[0]), so A > B, the 
Boolean representation is: 

X2'A[i] + r -X2'5[/] = 2"^' -\-f^2'-nA[i]-f^2'B\i] 

1=0 /=0 f=0 /=o 

= 2"'' -1-^2' S[i] 

1-0 

1=0 



where symbol -i denotes logical negation. This 
special circuit can be obtained from an adder circuit 
by connecting inverters to inputs A[0],..., A[n-1] of 
the adder (but not to inputs B[0],..., B[n-1]) and by 
connecting inverters to outputs S [0] , . . . , S [n] . The 
last output, -iS[n], is equal to 1 if and only if 
number A, having digits A[0],..., A[n], is greater 
than or equal to number having digits B[0],-.., 

B[n-1] , that is, A > B. 

In the aforementioned Gashkov et al. 
application, we describe a comparator that provides 
two outputs LT_LE and GE_GT . If the number of bits 
in A is greater than or equal to the number of bits 
in B, GE_GT = 1. Therefore, if GE_GT = 1, A > B. In 
an adder, adding digit-by-digit numbers A and B 
generate a sequence of carries W [1] , . . . , W [n+1] , which 
is recursively calculated as 

W[i + 1] = maj (A[i] ,B[i] ,W[i] ) = U [i] vV[i] &W [i] , 
where maj(A, B,W) is a majority function, namely 
maj(A,B,W) = A&BvA&WvB&W = UvV&W = U©P&W, 
where U=A&B, V=AvB, P=A©B, W[0]=0, W[1]=U[1], and 
S[i] = A[i]©B[i]©W[i] = P[i]©W[i], P[i] = A[i]©B[i], 
for 1 < i < n and S[n] = W[n]. (The function A©B can 
also be represented by XOR(A,B). The symbol & 
denotes logical conjunction and the symbol v denotes 
logical disjunction.) By induction, 

W[n] (A[0] ,B[0] , , . . ,A[n-l] ,B[n-l] ) 

= h n(U[0],U[l],V[l],...,U[n-l],V[n-l]), 



U[i] = A[i]&B[i], 

V[i] = A[i]vB[i], 0 < i < n-l, where the function 
h_n is determined from the equality 

h_n(U[0] ,U[1] ,V[i] , . . .,U[n-l] ,V[n-l] ) 

= U[n-l]vV[n-l]&U[n-2]vV[n-l] &V[n-2] &U [n-3] 
V. . .vV[n-l]&V[n-2] . . .V[i]&U[0] . 
(It will be appreciated that the above equality is 
the same as in the case of the comparator described 
in the aforementioned Gashkov et al. application.) 
Usually V[i] will be realized by the function 

A[i]eB[i] . 

The carries are calculated as: 

W[i+1] = W[i+1] (A[0],B[0],...,A[i],B[i]) 

= h_{i+l} (U[0],U[l],V[l],...,U[i],V[i]), 

1 < i < n-l, and 

W[l] (A[0] ,B[0] ) =h_{i}{U[0]) 
= U[0] 

= A[0] &B[0] . 

The standard adder will have an additional input CI 
(input carry) and an additional output CO (output 
carry) . The remaining outputs will realize all 

n-l n-l 

digits of the sum ^2'A[i] + Y,^'B\i] + CI . To include the 

1=0 i=0 

input carry signal into the adder, it is necessary to 
change the equality from S[0] = A[0]©B[0] to 
S[0] = A[0]®B[0]©CI and from W[l] = A[0]&B[0] to 
W[l] = maj (A[0] ,B[0] ,CI) 
= U[0]vV[0]&CI 
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= h_2{CI,U[0],V[0]) 

and from 

W[i+1] = h_{i+l} {U[0] .U[l] ,V[1] , - . .,U[i] ,V[i] ) , 
where 1 < i < n-1 

to 

W[i+1] = h_{i+2} (CI,U[0] ,V[0] , . . . ,U[i] ,V[i] ) , 
where i = l,...,n-l. 

In particular, the carry at output CO will be 

W[n] = h_{n+l}(CI,U[0],V[0],...,U[n-l],V[n-l]). 
To reduce circuit depth, and hence signal 

delays, it is preferred to employ an element composed 

of an OR element and inverter in place of XNOR 

elements: XNOR{a,b) = OR (NOT (a) , b) ) . 

1.2. Subtractor 

A subtractor circuit is implemented based on the 

following identities: 

Y^'Aii] -^TB[i] ^ C/ = 2" - 1 " f^T-^li] - 1;2'5[/] - CI 

t=0 1^0 1=0 1=0 

= 2""'-l-^2'S[/]-2" 

1=0 

= X2'-5'm-2" 

1=0 

which =|^2'-,5[/]-2", if S[n]=l 
and ^J^2'^S\i], if S[n]=0. 

1=0 

The subtractor can be obtained from an adder by 
attaching inverters to inputs A[0] , . . . , A[n-1] (but 
not to inputs B [0] , . . . , B [n-1] , CI) and by attaching 
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inverters to outputs S [0] , . . . , S [n] . Topologically 
the circuit can be kept the same if XOR elements on 
outputs of the circuit are replaced with XNOR instead 
of adding the inverters . As mentioned above, 
however, it is preferred to employ OR(NOT(a),b) in 
place of the XNOR element. 

1.3. Adder-Subtracter 

An adder-subtracter circuit is one having an 
additional input ADD_SUB. If ADD_SUB = 0, the 
circuit operates as an adder; if ADD__SUB = 1, the 
circuit operates as a subtracter. To construct the 
circuit inputs A[i], i = 0,...,n-l and all outputs of 
the adder are connected through additional XOR 
elements, with each additional XOR element having a 
common input ADD_SUB. If ADD__SUB = 0, the circuit 
will operate in the adder function because the 
additional XOR elements will simply pass the state of 
the input A[i] and output S[i] bits. If ADD_SUB = 1, 
the circuit will operate as a subtracter because the 
additional XOR elements will operate as inverters. 
The complexity of the circuit will increase by 2n + 1 
and the depth will increase by 2 when compared with 
an adder. 

1.4. Incrementer 

An incrementor with n inputs A[0] , . . . , A[n-1] can 

be represented as ^2'S[i] = ^2'A[i] = l . Therefore an 

incrementor is a simplification of an adder, with the 
second operand is equal to constant 1. If numbers A 
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and B are added in sequence a sequence of carries 
W [0] , . . . ,W[n] , is recursively calculated based on 

W[i] = U[i]vV[i]W[i-l] , i > 0, W[0] = U[0] where 

U[i] = A[i]&B[i], W[i] = A[i]vB[i] and the 
5 output functions are 

S[i] = A[i]©B[il©W[i-l] , 1 < i < n, and 

S[0] = A[0]©B[0], S[n] =W[n]. 

The sequence of the carries is calculated as 
W[0] =A[0], W[i] =A[i], W[i-1] = A[i] , . . • , A[0] , for 

10 i = l,...,n"-l. The outputs represent the functions 
S[0] = -.A[0], S[i] - A[i]©W[i-l], for i = l,...,n-l, 
and S[n] = W[n-1], because vector (B[0],..., B[n-1]) 
= (1,0, ...,rO). Therefore the incrementor can be 
implemented by using the input module of the adder 

15 providing the conjunctions W[i] = A[i]&...&A[0]) and 
the output module of the adder. The carry module is 
modified by simply coupling the output of the input 
module to the XOR elements of the output module. 

It is possible to reduce the delay of this 

20 incrementor circuit by using negation instead of con- 
junctions for W[i], using XOR elements at the outputs 
instead of XNOR elements, and forming the input 
circuits of OR elements with outputs coupled to NAND 
elements of the adder. One technique for 

25 distributing negations is described in the 
aforementioned Gashkov et al. application . 

It is possible to further reduce delay by 
duplicate elements of the circuit so that elements 
performing negation functions are adjacent the 
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corresponding elements and the circuit will be 
constructed only using elements NAND and NOR. One of 
the two mutually-inverse duplicate elements (having 
minimum delay) is selected and elements that are 
useless can be deleted from the circuit. The delay 
can be further reduced if the multiple input elements 
NAND and NOR are used. 
1.5. Decrementor 

A decrementor is similar to the subtracter with 
the second operand equal to a constant 1. To 
implement a decrementor, a subtracter is arranged to 

n-l 

subtract from the number A = ^TA[i] by negation on the 

n n-l 

inputs and output, as if ^2'iS'[/] = ^2'^[/] + l that 
J^T^S\i] = l""^' " 1 - J^TS\i] = 2"^' - 1 - J^2'A\i] - 1 

(=0 1=0 1=0 

= 2"+|]2'-v4[/]-l . 

t=0 

The decrementor is implemented similar to the 
incrementor, and it is not necessary to connect 
inverters with inputs as is necessary for the 
subtracter. Nor is it necessary to connect inverters 
to the output of circuit. Instead, the decrementor 
is formed by replacing the XNOR elements of the 
incrementor with XOR elements. Hence, decrementor 
and incrementor circuits have identical topology and 
can be composed from the dual elements in the 
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aforementioned Gashkov et al. application to realize 
the dual functions. 

1.6. Incrementor-Decrementor 

The incrementor-decrementor is a circuit having 
an additional input s. If s = 0 the circuit operates 
as an incrementor ; if s = 1 the circuit operates as a 
decrementor . To construct this circuit, the inputs 
and outputs of incrementor are connected through 
additional XOR elements, each having a common input 
s. If s = 0, the circuit will operate as an 
incrementor because the additional XOR elements will 
simply pass the state of the input A[i] or S[i] bits, 
respectively. If s = 1 the circuit will operate as a 
decrementor because the additional XOR elements will 
operate as inverters. The complexity of the 

incrementor-decrementor circuit will increase by 2n 
and the depth will increase by 2, as compared to an 
incrementor. 

1.7. Absolute Value Calculator 

By convention, the sign bit of a signed number 
is the most significant bit (n-1) . If the n-1 bit is 
a "1", the number is considered negative; if the n-1 
bit is a "0", the number is considered nonnegative. 
For an absolute value calculator, if the sign bit 
A[n-1] = 0, the circuit simply passes inputs to 
outputs without any changes. If the sign bit A [n-1] 
= 1, the circuit calculates the digits of number as 

n-\ n-1 

2" -^2'^[/] = l + ^2'^[/] . Therefore, the absolute value 

/=0 i=»0 
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calculator operates as an incrementor with inverters 
on the n-1 inputs. The circuit is topologically the 
same as the incrementor. The outputs are connected 
to multiplexers controlled by the input A [n-1] so if 
A [n-1] = 0, the multiplexer outputs are passed 
without change. If A[n-1] = 1, the multiplexer 
causes the outputs of the incrementor to be passed. 
The complexity of the absolute value calculator 
circuit is increased on the n~l path, and depth 
increases by 1 over the incrementor. 
2. IMPLEMENTATION OF AN ADDER 

FIG- 1 illustrates an adder circuit consisting 
of three modules: input module 10, carry module 12 
and output module 14. Input module 10 performs the 
functions 

U[i] = A[i]&B[i] and 

V[i] = A[i]vB[i], for 0 < i < n-1. 
Instead of A[i]vB[i], it is also possible to use 
A[i]®B[i], in which case U[i] and V[i] can be 
simultaneously realized by a half-adder element. One 
embodiment of input module 10 is illustrated in 
greater detail in FIG. 2A as including AND elements 
16 and OR elements 18. 

Output module 14 provides the output functions 
of the adder of 

S[i] = A[i]©B[i]®W[i] 

= P[i]ew[i], where P[i] - A[i]®B[i], 
for 1 < i <n, S[n] = W[n]. In one embodiment shown in 
FIG. 2B, the output circuit consists of 3-input XOR 
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elements E03(a,b,c) = a©b©c. It is possible to apply 
2-input XOR elements EO(a,b) = a©b = XOR(a,b) for 
optimization if a half-adder is used for input module 
10. In such a case P[i] = A[i]©B[i] has already been 
5 implemented at the input module, and the output 
module comprises EO(a,b) elements to perform the 
function S[i] = P[i]®W[i], 

The principal module of the adder is carry 
module 12 which realizes functions h_l, . . . , h_{n+l } • 
i^, 10 For sake of brevity n will be used instead of n+1 and 

O carry module 12 will be denoted as h_n. Recursive 

generation of the carry module H_n = {h_l, . . . , h_n} is 
Si based on application of identities h {k+1} = 

O h'_lvv'_l&h_k = A021 {v'_l,h_k:,h'_l) , where h'_l = 

r 15 h_l(U[k+l] ,U[k+2] ,V[k:+2] , . . . ,U[k+l] ,V[k+l] ) , and 

v' {1} = V {l,k} = V[k+1] &. . .&V[k+l] for 1 = 
l,...,n-k. As shown in FIG. 3, in one form module 
Q H_n consists of modules H_k = {h_l,..., h_k} and 

HV_{n-k} = {h'_l,v'_l, . . . , h'_{n-k} , v'_{n-k, } } and 
20 also their connecting modules A021 that performs the 
function A021(a,b,c) = (a&b)vc, shown in FIG. 4. 

A carry module, shown in FIG. 5 and designated 
HV_n, derives carries simultaneously with 
conjunctions {h_l, v_l, . . . , h_n, v_n} , where v_k 
25 v_{k,n} = V[n] &. . . &V[n-k+l] . Carry module HV_n is 
recursively generated from modules HV_k = 
h_l, v__l, . . . ,h__k, v_k and HV__{n-k} 

h' ) 1, v'_l, . . . ,h'_{n-k} , v'_{n-k} and their connecting 
module C_n,k. Module C_n, k is shown in greater 
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detail in FIG. 6 consisting of elements AND and A021 
{FIG. 4), due to identities 

H_{k+1} = h'_lvv'_&h_k = A021 (v'_l,h_k,h'_l) , 
h'l = h_l(U{k+l},D[k+2,V[k+2],...,U{k+l},V[k-fl], 
v'_l = v_{l, k} = V[k+1]&. . .&V[k+l}, 

for 1 = 1, . . . ,n-k 
V_{k+1} = v_{l,k}&v_k 

= (V[k+1] 6c. . , &V[k+l] ) & (V[k] & . . .&V[1] ) 
= V[k+1] &. . .&V[1] . 
The optimal choice of parameter k on each step 
of the recursion for the purpose of minimizing depths 
of a circuit is the same as for a comparator 
described in the aforementioned Gashkov et al. 
application. More particularly, k is selected from 
the closed interval between F_i to n-F_{i-l} = F_i, 
where F_i is a number selected from a Fibonacci 
series. Most particularly, the circuit depth is 
minimized by selecting k so that k > F_l, n-k < 
F_{1-1}, where the number 1 satisfies F_l < n < 
F_{1+1}, and the Fibonacci series F__l is defined 
recursively from the equality F_{1+1} = F_1+F_{1-1} 
using initial values F_l = 1, F_0 = 0. The value of 
k is determined uniquely when n is equal to a 
Fibonacci number; in other cases k is selected from a 
series of natural numbers [n-F__{ 1-1 } , F_l] . k may be 
selected arbitrarily from this series. 

As in the convention adopted in the 
aforementioned Gashkov et al, application, if k is 
selected as the left extremity of a series (least 
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allowable value) , the a circuit is referred to as a 
left-side circuit; if k is selected as the right 
extremity of this series (greatest allowable value) , 
the circuit is called right-side circuit. Also as 
described in the Gashkov et al. application, it is 
possible to distribute negations and execute other 
technological mapping techniques to derive 2-input 
elements in place of 3-input elements forming 
circuits that are topologically the same based on the 
following identities 

h__{k+l} = -.(-.h*_l) vv'_l&h__k 

= NAND(^h'_l,ND(v'_l,h_k) ) , 

^h_{k+l} = ^(h'__lvv'_l,h_k) 

= A06(v'_l,h_k,h'_l) , and 

V__{k+1} = v_{l,k}&v_k, where A06(a,b,c) 
NOR(AND(a,b) ,c) , 

In addition to using elements to form 
OR (NOT (a), b) in place of XNOR(a,b) elements, it is 
preferred to employ NOR and NAND elements instead of 
monotone OR and AND elements to reduce circuit depth 
and delay. 

3. OPTIMIZATION TECHNIQUES 
3.1, Distribution of Negations 

For brevity symbol f^'a designates the function f 
or its negation depending on whether a = 1 or 0« A 
circuit with two outputs that realizes functions h^a_n 
and v^'p^n may be defined as h^'av^'p, where a==0,l and 
P=0,1. 
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GP[a_l] [a_2] [p_l] [|5_2] [p_3] [p_4] designates a 
module with two outputs h and v and four inputs a, 
c, d, realizing functions A02r { a_l } (b^^i p_2 } , c'^fp^S}, 
a'^{p__l}), AND^{a_2} (b^{p_2}, and d''{p_4}), where 
5 A021{a,b,c) = OR (AND {a,b) , c) (FIGS. 4 and 7B) . For 
each of 64 modules^ the optimal implementation is 
selected based on area or delay. As another example, 
module GP [0] [0] [1] [1] [1] [1] may consist of elements 
A06 and NAND, where A06(a,b,c) = NOR (AND (a, b) , c) . 
10 See FIG. 7B. FIG. 8 illustrates a connecting module 

g GP. 

hi A module H"" {a_l } V {a_2 }_n may be defined to 

Si 

provide functions h'^la 1} n and v^{a 2} n, where a_l 
p5 =0,1 and a_2 = 0,1, and functions 

f 15 h_l, v_l, . . . ,h_{n-l} , v{n-l} . This module is generated 

hi recursively from modules H^{p_l}V^{P_2}_{k}, and 

ffj H'^{P_3}V^{p_4}_{n-k}, and their connecting module 

2 GP[a_l] [a__2] [p__l] [p_2] [P_3] [p_4] . Similarly, a module 

H^a_n as may be defined to provide function h'^a_n, 
20 where a = 0,1, and functions h_l, . . . , h_{n-l } , This 

module is recursively generated from modules 

H^{P_l}V^{p_2}_k and { p_3 }__{n-k} and their 

connecting module AO [a] [p_l] [P_2] [p_3] . 

The optimal distribution of negations is carried 
25 out as described in the aforementioned Gashkov et al. 
application. 
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3.2. Minimization of adder depth 

Adders are frequently used as the internal modules 
in the more complicated circuits, such as 
multipliers. To calculate the depth of outputs of 
5 the adder located as an internal module in a larger 
circuit, it is not practical to assume that the depth 
of the adder inputs is equal to zero. 

The depths of inputs A[0] , B [0] , . . . , A[n-1] , B [n-1] 
of the adder are defined as a_0,b_0, — ,a_{n-l}, 
10 b_{n-l} . The depth of the element providing 

p functions U[i] = A[i]&B[i] and V[i] - A[i]vB[i], for 

H 0 < i < n-1, is equal to U_i = max{a_i, b_i}+l, for 

2; 0 < i < n-i. Therefore, the problem of minimizing the 

yj 

O depth of an adder is reduced to one of minimizing the 

ni 

15 depth of circuit HV_n, realizing all functions 
^ h_l f v_l , - . - , h_n, v_n . 

Minimization of the depth of circuit HV_n is 

fli 

carried out, by considering U[i] = 1, for 0 < i < n-i 
H with the help of the optimal choice of parameter k in 

20 each step of recursive construction of HV_n from 
modules HV_{k}, HV_{n-k} and their connecting 
modules. The connecting modules consist of the 
parallel modules GP which are assumed, for example, 
to be formed from 2-input elements. 

25 DH_n is the maximum depth of all the outputs of 

module HV_n, and Dh_n is the depth of the output of 
module HV_n that realizes function h_n. A dynamic 
programming algorithm is applied to minimize the 
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depth, based on the recursive application of the 
formulas 

DH_n = inin{niax{DH_{n-k}+l,Dh_k+2,DH_k}}, 
Dh_n = min{max{Dh_{n-k}+l,Dh_k+2} }, 
5 where DH_k and Dh_k are calculated for vectors of 
depths U_i, for 0 < i < k-1, and DH_{n-k} and Dh_{n-k} 
are calculated for vectors of depths U_i, for k < i < 
n-1. If the minimum value of both DH_n and Dh_n are 
achieved for the same value of parameter k, that 
10 value is selected. Otherwise (if the minimiim value 

i.' s 

Q of both DH__n and Dh_n are achieved for different 

T^'f values of parameter k) , it is possible to accelerate 

H the algorithm (at the expense of a possible 

Q diminution of exactitude) by always selecting the 

15 smallest value of k. 
hh If exactitude is necessary, a vector DHh_n = 

pi 

I'l (DH n,Dh n) is calculated during each step of 

nj recursion on the earlier calculated vectors 

2 DHh_{n-k} , DHh_k. The calculation employs the 

20 formulas DH_n = max{ DH_{n-k}+l, Dh_k+2, DH_k} and 
Dh_n = max{ Dh_{n-k} + l, Dh_k+2 } for the components of 
vector DHh_n. The values k are sorted to derive a 
set of minimum vectors DHh_n (concerning the relation 
of component-wise comparison) . In further steps of 
25 the recursion, a search is conducted for a set of 
minimum vectors. The search examines all values of 
parameter k as well as all minimum vectors 
DHh_{n-k} , DHh_k obtained on the previous step of 
recursion. 
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Vector DHh_k is recursively calculated for each 
k-dimension subvector (u_a, . . . , u_{b+k-l} ) to 
calculate sequences of vectors DHh__n for a specific 
vector (u_0,,,., u_{n-l}). In the simplest 

5 implementation of this algorithm the memory is equal 
to O(n^), and the complexity is equal to 0(n"^). 
3,3> Minimization of Adder Fanout Depth 

If the depth of a circuit is defined as the length 
of its maximum chain of elements in which the 
10 adjacent elements are connected to one another, the 
fanout depth can be defined as the maximum sum of 
Q fanouts of elements of such circuits, where the 

fanout of an element E is the number of elements 
SI having inputs connected to an output of element E. 

pi 15 The delay of element E depends on the total 

capacitance of the indicated elements connected to 
pj it. The delay of circuit with given depth will be 

less if the fanout depth is less. Therefore to 

ly 

CS minimize the delay of a circuit, the fanout depth 

20 should also be minimized. 

Smaller fanout depth can be achieved by changing 
the adder implementation approach (described in the 
previous section) . The following describes 

implementation of an adder with smaller fanout depth, 
25 without changing the depth and only minimal increase 
in circuit area. 

Consider a system of functions 
HV'"k_n = {h^l_k,v^l_k, . . . ,h^{n-k+l}_k, 
V^in-k+l} k}. 
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where 

h"i_k = h_k(U[i-l] ,U[i] ,V[i] , . . .,U[i+k-2] , 
V[i+k-2] ) ,v'^i_k 
= V[i-l]&V[i]&. . .&V[i+k-2] . 
It is possible to recursively construct circuit 
HVk_n from modules HV^m_{n- (k-m) } and { k-m}_{n-m} 
and their connecting module, consisting of parallel 
located modules GP, based on the expansions 

H^i_k = h_k(U[i-l] ,U[i] ,V[i] , . . .,U[i+k-2] , 
V[i+k-2] ) 

=h'^ { i+m}_{ k-m} vv"^ { i+m}_{ k-m} Sh^'i^m, 
v^i_m = V[i-l]&V[i]&. . .&V[i+k-2] 

= v''{i+m}_{k'^m}&v'^i_m. 
The resulting circuit is illustrated in FIG. 9. If k 
= F__l, the number m in the next step of a recursion 
can be selected as F_{1-1}, where {F_l} is the 
Fibonacci series. Then using mathematical 

inductions, the depth of circuit HVk_n will be equal 
to 1-1, fanout-depth will be equal to 31-5, and the 
complexity of this circuit (the number of elements) 
is equal to 3n(l-2). 

Fanout of each element realizing the function 
v'^ilFJ} is equal to 3, and fanout of elements 
realizing the functions h^i_{F__j} is equal to 2. 
Each step of the recursion is performed using the 
same algorithm described in the aforementioned 
Gashkov et al. application to minimize parameter m. 
The circuit H_n is constructed recursively from 
modules H K and HV^k n and their connecting module, 
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consisting of parallel located modules GP, founding 
on the expansion 

h_i = (U[0],U[l],V[l],...,U[i-l],V[i-l]) 

= h^{i-k+l}_kvv'^{i-k+l}_k&h_{i-k}, i > k. 
The depth of this circuit can be calculated as 



and fanout-depth can be calculated based on a similar 



calculated fromL{n) = 3n (1-2) +L (k) , where 

k = F_l < n < F_{1+1}^ for F_l Fibonacci numbers. 

Therefore, the asymptotic inequality 
L(n) < 3(2+(p)n log_9 (n) , 



is obtained, where cp = (V5+iy2 is an irrational 
number . 

The adder based circuit is constructed using 2- 
input elements mapped from 3-input elements of a 
standard cell library. In the event 4-input elements 
are needed (as described in the aforementioned 
Gashkov et al. application), they may be constructed 
from 2-input elements. 

The present invention thus provides an adder 
based circuit embodied in an integrated circuit. The 
adder includes an input module, a carry module and an 
output module. The carry module has a minimum depth 
defined by a recursive expansion of at least one 
function associated with the carry module based on a 
variable k derived from a Fibonacci series. Through 
use of inverter elements, XOR elements, XNOR elements 
and multiplexers selectively coupled to the input and 



DH n = maxiDHV-^k n + 1, DH_{n-k}-h2}, 



formula. 



The complexity of this circuit can be 
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output modules, the adder based circuit can be 
configured to function as a subtracter, adder- 
subtractor, incrementor, decrementor, incrementer- 
decrementor or absolute value calculator. 

The process of designing the adder based circuit 
includes defining at least one carry function of the 
carry module of the adder based circuit in terms of a 
Fibonacci series, and recursively expanding the carry 
function to find a minimum parameter of the Fibonacci 
series. Optimization of depth and delay of the adder 
based circuit are achieved. 

In one form, the invention is carried out though 
use of a computer programmed to carry out the 
process. A computer readable program code is 
embedded in a computer readable storage medium, such 
as a disk drive, and contains instructions that cause 
the computer to carry out the steps of the process of 
designing a Fibonacci circuit in the form of a adder 
based circuit, including adders, subtracters, adder- 
subtractor s , incrementor s , decrementor s , incr ementor- 
decrementors and absolute number calculators. 

Although the present invention has been 
described with reference to preferred embodiments, 
workers skilled in the art will recognize that 
changes may be made in form and detail without 
departing from the spirit and scope of the invention. 



